The impact of Docker containers on the performance of genomic pipelines

نویسندگان

  • Paolo Di Tommaso
  • Emilio Palumbo
  • Maria Chatzou
  • Pablo Prieto
  • Michael L. Heuer
  • Cedric Notredame
  • Fabien Campagne
چکیده

Genomic pipelines consist of several pieces of third party software and, because of their experimental nature, frequent changes and updates are commonly necessary thus raising serious deployment and reproducibility issues. Docker containers are emerging as a possible solution for many of these problems, as they allow the packaging of pipelines in an isolated and self-contained manner. This makes it easy to distribute and execute pipelines in a portable manner across a wide range of computing platforms. Thus, the question that arises is to what extent the use of Docker containers might affect the performance of these pipelines. Here we address this question and conclude that Docker containers have only a minor impact on the performance of common genomic pipelines, which is negligible when the executed jobs are long in terms of computational time.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The impact

9 Genomic pipelines consist of several pieces of third party software and, because their experimental 10 nature, frequent changes and updates are commonly necessary thus raising serious distribution and 11 reproducibility issues. Docker containers technology offers an ideal solution, as it allows the 12 packaging of pipelines in an isolated and self-contained manner. This makes it easy to distr...

متن کامل

NGSeasy: a next generation sequencing pipeline in Docker containers [version 1; referees: 3 approved with reservations]

Bioinformatic pipelines often use large numbers of components Motivation and deploying them incurs substantial configuration and maintenance burden that remains a significant barrier to reproducible research. Our aim is to define a new paradigm and best practices for developing, distributing and running pipelines encapsulated in Docker containers (lightweight virtualization), with a focus on ne...

متن کامل

Performance Analysis of Containerized Applications on Local and Remote Storage

Docker containers are becoming the mainstay for deploying applications in cloud platforms, having many desirable features like ease of deployment, developer friendliness, and lightweight virtualization. Meanwhile, storage systems have witnessed tremendous performance boost through recent innovations in the industry such as Non-Volatile Memory Express (NVMe) and NVMe Over Fabrics (NVMf) standard...

متن کامل

About Microservices, Containers and their Underestimated Impact on Network Performance

Microservices are used to build complex applications composed of small, independent and highly decoupled processes. Recently, microservices are often mentioned in one breath with container technologies like Docker. That is why operating system virtualization experiences a renaissance in cloud computing. These approaches shall provide horizontally scalable, easily deployable systems and a high-p...

متن کامل

How does Docker affect energy consumption? Evaluating workloads in and out of Docker containers

Context: Virtual machines provide isolation of services at the cost of hypervisors and more resource usage. This spurred the growth of systems like Docker that enable single hosts to isolate several applications, similar to VMs, within a low-overhead abstraction called containers. Motivation: Although containers tout low overhead performance, do they still have low energy consumption? Methodolo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2015